Symbiosis in the Intranet: How Document Retrieval Benefits from Database Information

نویسندگان

  • Christoph Mangold
  • Holger Schwarz
  • Bernhard Mitschang
چکیده

The enterprise information space is split in two hemispheres. Documents contain unstructured or semistructured information; structured information is stored in databases. As regards the content, both kinds of information are complementary parts. However, enterprise information systems usually focus on one part, only. Our approach improves document retrieval in the intranet by exploiting the enterprise’s databases. In particular, we exploit database information to describe the context of documents and exploit this context to enhance common full text search. In this paper, we show how to model and compute document context and present results on runtime performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model.   Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Eecient Real-time Index Updates in Text Retrieval Systems

As information retrieval (IR) systems emerge as the mainstream information nding tool within commercial enterprises due to the enormous popularity of World Wide Web (WWW) technology in the intranet environments, the ability to incorporate new and/or updated documents into the database in real time becomes an essential requirement. However, conventional IR systems are optimized for read queries ...

متن کامل

A UMLS-based method for integrating information databases into an Intranet

The Internet and the World Wide Web provide today end-users with capabilities to access universally to information in various and heterogeneous databases. The biomedical domain benefits from this new technology, specially for information retrieval by searching and browsing various sites. Nevertheless, end-users may be disoriented by specific ways to access information on different servers. In t...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006